Standard Deviation as a Query Hardness Estimator
نویسندگان
چکیده
In this paper a new Query Performance Prediction method is introduced. This method is based on the hypothesis that different score distributions appear for ‘hard’ and ‘easy’ queries. Following we propose a set of measures which try to capture the differences between both types of distributions, focusing on the dispersion degree among the scores. We have applied some variants of the classic standard deviation and have studied methods to find out the most suitable size of the ranking list for these measures. Finally, we present the results obtained performing the experiments on two different data-sets.
منابع مشابه
Wavelet Threshold Estimator of Semiparametric Regression Function with Correlated Errors
Wavelet analysis is one of the useful techniques in mathematics which is used much in statistics science recently. In this paper, in addition to introduce the wavelet transformation, the wavelet threshold estimation of semiparametric regression model with correlated errors with having Gaussian distribution is determined and the convergence ratio of estimator computed. To evaluate the wavelet th...
متن کاملAlgorithms based on sparsity hypotheses for robust estimation of the noise standard deviation in presence of signals with unknown distributions and occurrences
Inmany applications, d-dimensional observations result from the randompresence or absence of random signals in independent and additive white Gaussian noise. An estimate of the noise standard deviation can then be very useful to detect or to estimate these signals, especially when standard likelihood theory cannot apply because of too little prior knowledge about the signal probability distribu...
متن کاملA Study on the Class of Chain Ratio–type Estimators
This paper considers the problem of estimating the population mean Ybar of the study variate Y using information on different parameters such as population mean $(bar{X})$, coefficient of variation $(C_x)$, kurtosis $beta_{2(x)}$, standard deviation $(S_x)$ of the auxiliary variate x and on the correlation coefficient, $rho$, between the study variate $Y$ and the auxiliary variate $...
متن کاملA Two-Phase Robust Estimation of Process Dispersion Using M-estimator
Parameter estimation is the first step in constructing any control chart. Most estimators of mean and dispersion are sensitive to the presence of outliers. The data may be contaminated by outliers either locally or globally. The exciting robust estimators deal only with global contamination. In this paper a robust estimator for dispersion is proposed to reduce the effect of local contamination ...
متن کاملAlgorithms and Applications for Estimating the Standard Deviation of AWGN when Observations are not Signal-Free
— Consider observations where random signals are randomly present or absent in independent and additive white Gaussian noise (AWGN). By using a recently established limit theorem, we introduce a new estimator for the estimation of the noise standard deviation when the signals are less present than absent and have unknown probability distributions. The bias, the consistency and the minimum attai...
متن کامل